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Abstract 

A novel dynamical model for the study of operational risk in banks and 
suitable for the calculation of the Value at Risk (VaR) is proposed. The 
equation of motion takes into account the interactions among different bank's 
processes, the spontaneous generation of losses via a noise term and the efforts 
made by the bank to avoid their occurrence. Since the model is very general, 
it can be tailored on the internal organizational structure of a specific bank 
by estimating some of its parameters from historical operational losses. The 
model is exactly solved in the case in which there are no causal loops in 
the matrix of couplings and it is shown how the solution can be exploited 
to estimate also the parameters of the noise. The forecasting power of the 
model is investigated by using a fraction / of simulated data to estimate the 
parameters, showing that for / = 0.75 the VaR can be forecast with an error 
2 ~ KT 3 . 
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1. Introduction 

The methods developed in the context of statistical mechanics and, more 
in general, in the study of complex systems have found in the last years broad 
application in many different scientific fields. Economic sciences particularly 
benefited from interdisciplinary approaches and borrowed some crucial ideas, 
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powerful tools and techniques [l| from those fields. However, these efforts 
have been devoted almost exclusively to the study of the financial risk [2|, 
and only more recently also other typologies of risk [3] as the operational risk 
[4|, |5j are gaining more and more attention. 

Operational risk is "the risk of [money] loss [in banks] resulting from in- 
adequate or failed internal processes, people and systems or from external 
events" [6|, including legal risk, but excluding strategic and reputation linked 
risks. Let us make an example to clarify the dynamics underlying the gen- 
eration of operational losses; suppose that a material damage in the system 
that controls and authorizes the transactions occurs and is discovered at the 
time t%, but repaired only later at the time t 2 ; a loss equal to the amount of 
money needed to repair the damage is generated at the time t\ in the process 
of machinery servicing, but the failure has likely generated losses delayed up 
to the time £2, because some transactions have failed or have been wrongly 
authorized. This example shows that the different processes may be strongly 
correlated, and that their typical correlations extend over time. 

The primary goal of the management of operational risk is to determine 
the capital charge that the bank has to put aside (e.g. every year) to cover the 
operational losses. The New Basel Capital Accord [6] roughly proposes to set 
this capital to the 15% of the bank's gross income, or to consider the gross 
income per business line and weight each one with a coefficient ranging from 
12% to 18%; we will call these approaches "macroscopic", since they assume 
that the capital requirement does not depend on the details of the internal 
structure of the bank, but only on its size. However, the basic assumption 
of these approaches seems not to be realistic and they do not provide any 
insight on the mechanisms underlying the generation of losses, not allowing 
any practice aimed to foresee or reduce the future losses. 

The New Basel Capital Accord also envisages that each bank is free to 
develop its own approach to the evaluation of the capital requirement as long 
as it satisfies some general requirements. One possible approach is the "mi- 
croscopic" one in which one tries to take into account all the fundamental 
mechanisms involved in the generation of operational losses; it is clear that 
in this framework one should deal not only with variables representing oper- 
ational losses but with heterogeneous variables which strongly depend on the 
particular mechanism under examination. Although these approaches are 
more solidly founded than the macroscopic ones, they suffer from practical 
drawbacks; firstly, it is extremely difficult to introduce a dynamical model 
that couples all the microscopic variables in a realistic way: for this reason 



some attempts in this sense |7j have been done using bayesian networks, 
which allow to model statistical correlations among heterogeneous variables, 
but do not allow to follow the evolution of losses in time. Secondly, the im- 
plementation of such an approach inside a bank requires that all the relevant 
variables should be monitored reliably with a certain frequency, which can 
be an extremely resource demanding task, especially for a small or medium 
sized bank. 

An alternative approach may be to provide a description of loss events 
based on an effective model consisting of much fewer degrees of freedom 
than a microscopic one, but still able to distinguish the internal structure 
of a bank; it is natural to call such an approach "mesoscopic", since one can 
think that the effective variables are obtained by integrating out the details 
contained in the microscopic ones. In the non-macroscopic approaches the 
capital requirement is usually identified with the Value-at-Risk (VaR) over 
one year and with 99.9% level of confidence, i.e. the 99.9 percentile of the 
yearly loss distribution; this implies that the probability of registering a 
loss being greater than the value of the VaR in one year is equal to 0.001 
or, equivalently, that such a loss may occur on average every 1000 years. 
The most widely used non-macroscopic approach is mesoscopic: the Loss 
Distribution Approach (LDA) [8|, |9j, [lOj classifies the loss events in 8 business 
lines (sectors of activity of the bank) and 7 event types (causes of the loss) 
and identifies the relevant variables with the number of losses occurred during 
a certain time horizon (frequency) and with the amount of a single loss 
(severity) for each of the 56 couples (business line, event type); the LDA 
makes use of the distributions of frequency and severity to derive the loss 
distribution over the given time horizon, usually assuming that frequency 
and severity for each process are independent random variables, and thus 
failing in capturing the correlations among different couples. There are some 
proposals of how to take into account the correlations in the context of LDA, 



e.g. among frequency of different couples J9|, [ll|, [12| , frequency and severity 



of the same couple |l3|, frequency and severity of different couples |14J Il5 



or aggregate losses of different couples |14J Il6| . or in different frameworks 



|171 Il8j . but no one has gained a general consensus. It is worth pointing out 
that it is very unlikely that mechanism of loss production involves frequency 
and severity, which should be simply considered statistical tools to model the 
loss distribution over the given time horizon: for this reason it is not obvious 
how to incorporate the time dependence into the LDA framework. 

A different possible mesoscopic approach consists in assuming that the 



effective variables are the degrees of freedom of a dynamical system and pos- 
tulating an effective equation of motion |l9J. The model should be sufficiently 
general to explain the dynamics of loss production in all the banks, but flex- 
ible enough to adapt to the particular internal structure of a specific bank, 
for example by properly tuning the parameters appearing in the equation 
of motion. Once the parameters have been estimated, the advantage of a 
dynamical approach is immediately evident: one may follow the production 
of the losses during time and thus may be able to make predictions about 
the evolution of losses. In the approach presented in this paper the effective 
variables are the amount of losses registered at a certain time in a certain 
process; processes are categories in which losses are classified and depend on 
the specific structure of a bank; examples are material damage, failed trans- 
action or fraud. The equation of motion includes two general mechanisms for 
the generation of losses in a process: the interactions with other processes 
and the spontaneous generation due to a random noise; the possibility that 
the bank invests money to avoid the occurrence of losses is also taken into 
account. As the equation of motion contains a noise term, the loss distribu- 
tion will naturally arise considering several realization of the noise: therefore 
the VaR can be still taken as a measure of the capital requirement. Since 
the different-time correlations play a crucial role, the interaction term is non- 
local in time. Let us point out that, even in the case in which the microscopic 
dynamics of the system was local in time, it is perfectly reasonable to assume 



an effective dynamics that is non-local in time: it is a well known result [20 
that the reduced dynamical system obtained integrating out some degrees of 
freedom from a dynamical system with equation of motion local in time is 
in general characterized by an equation of motion which is non-local in time. 
From this point of view, the non-locality in the equation of motion is justi- 
fied a priori, basing on very general considerations, rather than a posteriori, 
basing on some peculiar features of the loss distributions. 

A dynamical model for operational risk has been already proposed in 
[2ll |22| . and also applied to the study of credit risk [23|. There are some 



important differences between the approach i n |21l. |22| and the one proposed 
in this paper. Firstly, while the dynamics in [2ll . |22j is local in time, the one 
proposed here is not, for the reasons that we have just discussed. Secondly, as 
explained in Section [61 the dynamics proposed here allows the estimation of 
the parameters of the noise. In addition, even if it is possible to show that the 
pro posed dynamics is equivalent to a dynamical generalization of the LDA 
|2l| (see Section E]), it is possible to interpret all the terms in the equation 



of motion as general mechanisms which are responsible for the generation of 
operational losses. From this point of view, it is natural to build a dynamics 
which directly involves the amount of losses registered in each process. Such 
an approach has been introduced in J24J, where also a detailed comparison 
with the framework of frequency and severity is made. The methodological 
advantage is that one has not to make direct assumptions on the shape of 
the loss distribution, but only on the basic mechanisms that generate the 
losses: in a sense, the features of the loss distribution emerge from those 



mechanisms. This is a fundamental difference with respect to |2lLl22j. where 
the form of both frequency and severity distributions are imposed a priori. 

The paper is organized as follows: in Section [5] the model is introduced 
and in Section [3] it is shown that under some hypothesis it can be exactly 
solved; in Section H] it is illustrated how some parameters of the model can 
be estimated from real data; in Section [5] the proposed procedure to estimate 
the parameters is validated by means of simulated data; moreover, the capa- 
bility of model to forecast future operational losses is tested by estimating 
the parameters only from a fraction of simulated data and comparing the 
forecasts made by the model with the remaining part; in Section [H]the model 
is compared with an alternate one, while in Section [7] some conclusions are 
drawn. 

2. The Model 

The model consists of N positive real variables /,(£) that represent the 
amount of loss (in some currency) registered in the process i at the time 
t and that evolve by means of a discrete time equation of motion. The 
variables are coupled through the matrix J which in general is not symmetric: 
Jij 7^ means that ^ is influenced by lj and not vice versa; the equation of 
motion is non-local in time in the sense that, if J^ ^ 0, li(t) depends on 
I j{t — 1), . . . , lj(t — £*■) which are the values that lj takes in the past £*• time 
steps; t*j can thus be thought as an asymmetric time of correlation between 
the variables lj and U. The equation of motion is: 

k{t) = Ramp | J2 J *i C v (*) + 9 * + & (*) ) > ( l ) 



where the ramp function: 

I x for x > 
Rampfa;) = < 

1 for x < 

ensures that /j(t) G M + , VI The positive terms in the argument of the ramp 
function in flTJ) tend to generate a loss, while the negative terms tend to avoid 
the occurrence of a loss. The presence of the ramp function in ([1]) excludes 
the possibility of negative losses which could be interpreted as reserves of 
money put aside to automatically lower future losses. 

Cij(t) simply counts the number of lj(t) greater than zero in the time 
interval [t — £*,-, t — 1]: 

Ca(t)= E ePi(*-*)l> ( 2 ) 

Ks<t*. 

— — *j 

where is the Heaviside function. Eq. (J2J) implies that Cij(t) G {0, 1, . . . , £*■} 
and the coupling term in (JTJ can assume only the values 0, Jy, 2 Jy . . . , £*,-Jy , 
so that, if Jjj t^ 0, /;(£) does not depend on the values of /j(t — s), but only on 
the number of times in which lj(t—s) is greater than zero for s G [£—£!•, i— 1]. 
This means that, if Jy > 0, each loss occurred in the process j between the 
time steps t — £*■ and t — 1 generates a potential loss of amount J^ in the 
process z at time t; on the other hand J^ < means that a loss in the 
process j may help the process i to function properly. Such an interaction 
term implies the following approximation: a potential loss generated by other 
losses does not depend on their amount, but only on their number within a 
certain maximum correlation time. The non-locality in time of (J2J) is crucial 
to take into account the different-time correlations, as pointed out in Section 
[TJ Let us incidentally notice that p]) requires an initial condition consisting 
of a number of time steps equal to the maximum of t* ,-. 

The inhomogeneous external field 9i has two very different interpretations 
depending on its sign; a field term 9i < can be interpreted as the effort 
(investment) made by the bank to avoid the occurrence of losses in the process 
i: in fact the sum of the interaction term and &(£) has to be greater than 
\6i\ to effectively produce a loss. In this scenario the fact that d\ does not 
depend on time implies that the amount of money (per unit of time) to invest 
on each process is chosen a priori and kept fixed for a long period of time, 
rather than dynamically adjusted "on the fly". A field term 9% > could be 
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interpreted as a pathological tendency of the process i to produce losses at 
every time step and thus is undesirable in this context. 

£i(t) is a random noise 5-correlated in time that accounts for sponta- 
neously generated losses, i.e. losses that are not caused by the occurrence of 
other losses; this interpretation implies that it must have a positive support. 
As discussed in detail in Section [21 the analytical results that will be obtained 
are very general, in the sense that they can be easily extended to different 
distributions of the noise, provided it satisfies some very general hypotheses. 
To fix the ideas we choose &(£) to have an exponential distribution: 

p(£i) = \ie~ XA (3a) 

W)) = I; (3b) 

the rationale behind such a choice is the following: as it can be intuitively 
argued, spontaneous losses (like those caused by human errors, or machine 
failures) are relatively rare events: such a behavior can be obtained by setting 
9i < and \$i\ > 1/Aj since the chosen distribution is exponential and the 
majority (~ 63%) of the potential losses generated by the noise are smaller 
than its mean value 1/Aj. Because of the presence of noise in ([T|) li(t) is 
a random variable; from this point of view we say that the model can be 
exactly solved if all the moments of the probability distribution of Zj(t) can 
be calculated. 

The crucial quantity for the study of operational risk is the cumulative 
loss up to the time t: 

Zl (t) = Y,h(s), (4) 

s<t 

which can be taken as an approximated indicator of the capital that should 
be put aside to face operational risk over a time horizon t. 

3. Model Solutions 

In this section it will be shown that, if the structure of the coupling matrix 
J satisfies some peculiar hypotheses, the model can be exactly solved in the 
sense specified in Section [2] and the asymptotic behaviour of Zi(t) can be 
determined. 



We give some preliminary definitions: a process % is said to be influenced 
by a process j if J y 7^ 0; a process i is said to be free if it is not influenced 
by any process (including itself), i.e. J y = 0, Vj. These definitions are 
coherent with the mechanism of the interaction term in ([T|): in fact a loss 
occurred in the process j may cause a loss in the process i only if J y - 7^ 0. 
The hypothesis on the structure of J can be stated in the following way: 
let us associate to each process a node in a graph and, if the process i is 
influenced by the process j, let us draw a directed edge from the node j 
to the node i; the graph obtained considering only the nodes influencing 
directly or indirectly the node i, together with the node % itself will be called 
the subgraph associated to the process i; if the subgraph associated to the 
process % is a directed acyclic graph, i.e. if the edges in the graph do not form 
any closed loop (see [25| for basic definitions about graphs), all the moments 
of the distribution of (h(t)) can be exactly calculated. In such a case we 
say that the subgraph associated to the process i has no causal loops; the 
meaning of this definition can be understood considering a graph with a loop 
like i — > j — > i: in such a case the losses occurred in the process i may cause 
other losses in the process j, which in turn may cause other losses in the 
process i, resulting in a causal loop. If the whole graph associated with the 
coupling matrix J is a directed acyclic graph we say that the matrix J has no 
causal loops: in such a case the subgraphs associated to all the processes have 
no causal loops, and therefore the model can be exactly solved. We remark 
that the absence of causal loops is a commonly accepted hypothesis, e.g. in 
the context of other tools which are widely used to take into account the 
correlations among different process, like bayesian networks |7|, [l3|, |l7|, LIS . 



Only the cases relative to the two simplest subgraphs will be treated here, 
deferring a more general discussion to the Appendix. Let us start with a free 
process i, i.e. the subgraph associated to the process i is just a node with 
no incident edges. In this case the random variable U{t) is independent from 
lj(t'), Vj, t' and the n-th moment of its distribution is simply the average of 
lf(t) over the noise, i.e. the average over the random variable £j(£) (we will 
use d£i(t) as a shorthand for p(£j) d£i(t)): 
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defining 

POO 

m[ n)F (x)= Ramp[x + &(£)] n ^(£) (6) 

Jo 

we have: 

A, <>i < U (7) 

6> ■ + J- if 6», > V ; 

where mf has been used (and will be used in the following) as a shorthand 
for m\ . The mean of lf(t) can be analogously calculated: 



(l*{t)) = mf^m = { J[ 26i 2 - -• ■ - , (8) 



A? 


if ft < 


/}2 , 2 6»i _|_ 2 

y * + — + aF 


if ft > 




so that the variance of li(t) is: 

™u(t) = m))-ut)) 2 

!i( 2 -e AA ) if ft <0 (9) 

if ft > ' 

As expected for a free process, (h(t)) and var Zj(£) do not depend on time, and 
all the moments of the probability distribution of li(t) do not as well; Zi(t) 
is thus the sum oft independent and identically distributed (i.i.d.) variables 
with finite variance and, by means of the central limit theorem, for sufficiently 
large t it has a Gaussian distribution with mean and variance: 

(zi(t)) = t (k{t)) (10a) 

v&t Zi(t) = t varZj(t) . (10b) 

The next step is to repeat the calculation of (J7]) and (Q for the process i 
in the case in which it is influenced only by a single process j and the process 
j is a free process (i <— j). Since in this case li(t) depends through Cij(t) only 
on I j(t — 1), ... , lj(t — £*■), the average over the noise equals to the average 
over the random variables &(£), £-,(£ — 1), . . . , £-,(£ — £*,-): 

POO 

(h(t))= Ramp [J ii Ci;(*)+ ft + &(*)] 11 d^(t - s) d^t) ; (11) 

Jo Ks<t* A 



let us observe that the domain of integration of the variables £j(t— 1), . . . , £,j{t— 
£*■) can be divided in subsets obtained by fixing the value of Cy(t); since the 
events CV,(£) = 0, . . . , Cij(t) = £*• are mutually exclusive and cover the entire 
domain of integration: 



s). (12) 



Each term in the summation on the right hand side of (I12|) is simply the 
probability that Cij(t) = c, i.e. the probability that c elements in the set 
{lj(t — 1), . . . , lj(t — £,*•)} are greater than zero and £*• — c elements are less 
than or equal to zero; since the process j is free, the probability that lj(t) > 
is easily calculated: 



/■oo 

n / <&(*- 


»j /> 

-)=£/ n 

c=0 J Cij{t)-c i <s<t 



/■OO 

Pr[/ 4 (t)>0]= / e[li(t)]dZi(t) 
Jo 
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(13) 



defining 



we have: 



that yields: 



/■oo 

pf(x)= / Q[x + Ut)]dUt) (14) 
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/ n <&(* - s ) = H?) [pf (^)] c t 1 -pf &)]*"* ■ ( ie ) 



Cij(t)-c 1<s<t 

"J 

Using (fT6|) and (J6]) ; (ITT]) becomes: 



&(*)> = J2 / 1 1 <&•(* - s ) / Ram p t cJ ^ + ^ + &(*)] <&(*) 

^•/QjW^ !<,<*.. Jo 
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c=0 

(17) 
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The same line of reasoning leading from (II ip to ( II 7p can be followed to 
calculate the variance: 



var li (t) 



E 

c=0 



(f ) [pjm c [i-pjm th ~ c m^ic^+e, 



- (kit))' 



(18) 

or any moment of the distribution of /»(£). Even in this case Zi(t) is the 
sum i.i.d. variables with finite variance and thus (flOl) is also valid; in the 
Appendix it is shown that (flOl) still holds in the more general case in which 
the coupling matrix J has no causal loops; as a consequence, the non-locality 
of the equation of motion alone is not sufficient to modify the shape of the 
cumulative loss distribution. Actually (TTUj) has a crucial importance: while, 
at least in principle, it is possible to think at an extension of the technique 
used here and in the Appendix to calculate the moments of li(t) also in the 
case in which the matrix J has causal loops, the random variables k(t) for 
different values of t would be neither independent nor identically distributed 
in that case, and ( TTUj) would not hold anymore. Determining the moments 
of Zi(t) by the explicit calculation of the moments of k(s), Vs < t is also 
hopeless, since it would become exponentially complex in t, as shown in the 
Appendix. It is worth noting that (TTTj) . (Tl8|) and their analogous in the 
Appendix reduce the calculation of the mean and the variance of Zj(£) to 
the calculation of mf and m\ and pf , which are the only quantities that 
depend on the particular distribution of the noise. This means that those 
expressions may be easily generalized to a different distribution of the noise 
simply by recalculating mf , m\ and pf from <Q and (IT4"j) . provided that 
the corresponding integrals converge; in particular, while pf is always finite, 
it can be easily shown that mf (m\ ) is finite if and only if the mean (second 
central moment) of the noise is finite. However, if m\ diverges, the central 
limit theorem does not apply and the distribution of Zi(t) at large t is not 
Gaussian. 

4. Parameters Estimation 

In this section a scheme for estimating the parameters of the model from 
real data will be presented. In the more general case 6 and J can be esti- 
mated, but the parameters A of the noise must be known a priori. If the 
graph associated to the matrix J is known and has no loops, i.e. if according 
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to the definition given in Section [3] the matrix J has no causal loops, the 
model can be integrated and the additional constraint imposed by the exact 
solution can be exploited to estimate also A. Let us remark that knowing the 
graph associated with J does not mean knowing the values of the elements of 
J, but only which elements of J are equal to 0, i.e. knowing the relationships 
of influence among the processes. The matrix t* of the times of correlation 
must be known a priori in every case. 

In the context of operational risk real data come in the form of a database 
of historical operational losses; such a database is a collection of loss events 
occurred inside a bank; in order to be suitable for the estimation scheme that 
we are describing, the database must keep track of the amount, the process 
in which and the time at which each loss event occurred. The time resolution 
of the database is identified with the discrete time step of the model and the 
time at which the oldest loss occurred with t — 0, so that the database can 
be thought of as a realization of ([1]). Since in this section there is no risk of 
ambiguity in the notation, the amount of loss registered in the database at 
the time step t in the process i will be denoted with li{t). 

4-1. Estimating 9 

In order to estimate 9i let us look in the database of operational losses 
for the events such that CV,(£) = 0, Vj; assuming that the database is a 
realization of ([T]) we have: 

Zi(t)= Ramp [ft + &(*)] ; (19) 

the probability that Zj(£) = 0, conditioned on the occurrence on such events 
is: 

Pr \k{t) = | Cij{t) = 0, Vj] = Pr % < -9i] , (20) 

where the dependence of £, on t has been dropped since its distribution does 
not depend on time. In order to make a frequentist estimate of the left hand 
side of ( 12 Up one would need a sample of values of li(t), which is obviously not 
possible using a single database which contains only one value of Z, at the 
time t; however, since the right hand side of (120]) does not depend on time, 
also the left hand side must not: 

Pr [k = 0\ dj = 0, V j] = Pr [& < -9 t \ 



[ l Xte-^dti (21) 

Jo 



l-e Al 
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where the left hand side has the meaning of a frequentist estimate from the 
database: 

^-°i^-Mii- ^a-VvS vi)1 - <22) 



9i can be estimated inverting (1211) : 

ft = 1 log (1 - Pr [Z, = | C ti = 0, Vj]) ; (23) 

let us explicitly notice from (123]) that the values of ft estimated in such a 
way are negative. 

Let us make an example using the excerpt of a possible database shown 
in Tab. HJ for simplicity we assume that £*• = 2, Vi and j. Let us suppose 
to be interested in estimating the value of 9\\ according to ( 122]) we need to 
count the events such that Cy = 0, Vj; the counting starts from the first 
time step and proceeds using a moving window of width equal to £* • time 
steps: in this case one starts considering time steps 1 — 2 and subsequently 
moves to 2 — 3, 3 — 4, etc. From Tab. [T] and using ($Z§ we see that for the 
event corresponding to time steps 1 — 2 we have C\j = 0, Vj, meaning that 
Fr \C\j = 0, V j] must be incremented by one. To count the events such that 
li = and C\j = 0, Vj one has to consider one more time step: as h(3) = 0, 
also Fr [(Zi = 0), (Cy = 0, V j)] must be incremented by one. 

4-2. Estimating J 

The estimation of Jij uses the same line of reasoning followed to estimate 
6i from which differs only by the fact that it is based on different kinds 
of events; in this case we look for the events such that Cij(t) = c with 
c = 1, . . . , t*j and Cikit) = 0, k ^ j; for such events flTJ reads: 

l t (t) = Ramp [cJy +9 { + £i(t)] ; (24) 

the probability that Zj(£) = 0, conditioned on the occurrence on such events 

is: 

Pr [h(t) = | C i3 (t) = c, C ik {t) = 0, k ^ j] = Pr [& < -9 { - cJy] ; (25) 
proceeding like in (j2~Ij) we find: 

Pr [U = | dj = c, C ik = 0, k^j] = Pr [& < -ft - c^-] 

p—&i — cJij 

= / A^e-^^ (26) 

Jo 
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Table 1: Excerpt of six time steps of a possible database composed by five processes, where 
each row corresponds to a different time step and each column to a different process. Losses 
are indicated by •, while empty spaces correspond to zero losses: e.g. both h(3) and ^(4) 
are different from zero. 



1 

2 
3 
4 
5 
6 



where the left hand side of (126]) has again the meaning of a frequentist esti- 
mate: 



Fr [C^ = c, C ifc = 0, fc ^ j] 

" (27) 



and Jij can be estimated inverting (126 



c 



-^ + \- log (1 - Pr ft = | dj = c, C lk = 0, k ^ j}) 
A,- 



(28) 



Let us notice that (128|) puts a subtle constraint on the parameters that can 
be estimated: cJij + 0, < 0, Vc; if 0j < (which is the case we are interested 
in) this translates into £*,•</# < |#i|. 

In the context of operational risk the constraints imposed by (j23|) and 
( f28|) mean that the bank is exerting a control on the processes so strong 
that the interactions alone are not sufficient to generate a loss; in such a 
scenario a loss occurs when the noise is greater than the threshold set by the 
negative 9,i and the interaction term (if J^ > 0) provides a mechanism to 
dynamically lower this threshold. In the case of a practical implementation, 
the soundness of these contraints should be certainly checked by experts in 
the organizational structure of the bank. 

Also in this case Tab. [T]can be used to clarify how the events relative to 
the estimation of J^ are identified. Let us suppose to be interested in the 
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estimation of J 43 ; from (I2"7j) we see that we need to count the events such 
that C43 = c, Cik = 0, for k 7^ 3; from Tab. [I] we see that time steps 2 — 3 
contribute to the case in which c = 1, while time steps 3 — 4 contribute to 
the case in which c = 2, meaning that both Fr [C43 = 1, C^ = 0, k 7^ 3] and 
Fr [C43 = 2, C4A; = 0, k 7^ 3] must be incremented by one. Since ^(4) = 0, 
also Fr [(/ 4 = 0), (C43 = 1, C4& = 0, k 7^ 3)] is incremented by one, while it 
is not the case for Fr [(7 4 = 0), (C43 = 2, C^ = 0, k 7^ 3)], since ^(5) 7^ 0. 

^..J. Estimating A 

In order to estimate the value of A; the exact expression of {h(t)) will 
be exploited; since it is available only in the case in which the subgraph 
associated to the process i has no loops, the discussion will be restricted to 
this case. If this is true for all the processes, the whole graph associated with 
the coupling matrix J has no loops and A, can be estimated Vi. Let us start 
with the case of a free process i; using ([7]), (123|) and f fTUj) we have: 

A, = -^— (1 - Pr [h = 1 dj = 0, V j]) , (29) 

where the case 9% < of (J7J) has been considered since ( 12"3"|) does not allow 
positive estimates of #;. In (129]) (zi(T)) has been replaced by the actual value 
calculated from the database of operational losses basing on the following ar- 
gument; Zi(t)/t is the sample average of the random variables /j(t) which are 
i.i.d. with finite mean given by (JTj); according to the law of large numbers 
Zi(t)/t -> (hit)) that, together with ([T0l) , yields Zi(t)/t -)■ (zi(t))/t; as dis- 
cussed at the end of Section [31 this argument only applies to all the cases in 
which the coupling matrix J has no loops. 

For a process i that is influenced only by a single free process j, ([1] 
flU, (T58D and flTD]) yield: 

A * = ~m 5Z(l-Pr[Z, = 0| C 4 , = c, C lfe = 0, k ^ j]) 



Zi(T) 



c=0 



^)(1-Pr[/ J = 01^ = 0, Vj]) c 



■(Pi[l i = 0\C ij = 0, Vj]) £ -- C , (30) 

where again the case #j < from ([7]) has been considered and (zi(T)) has 
been replaced by Zj(T). Once A, has been estimated through (129)) or ( 130]) 
and inserted into f[2"3l and (I2"5j) . ^ and JJy can be also estimated. 
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In the more general case in which the coupling matrix J has no loops 
(TTU|) still applies and (12§|) and (130 j) can be extended using ( I23p . (j28p and the 
results in the Appendix. In the most general case in which the matrix J has 
causal loops, A, may be elicited in an empirical way by assessing the mean 
value of a spontaneous loss in the process i, or by inverting (Tl5l) and assessing 
the probability that a spontaneous loss occurs in the same process. 

5. Results 

In order to check the consistency of the method proposed to estimate the 
parameters of the model we go after the following steps: i) we let the system 
evolve for T time steps, ii) interpret the resulting trajectory (which will be 
called original trajectory in the following) as a database of operational losses 
and estimate the parameters, iii) insert the estimated parameters in ([I]) and 
sample a great number of trajectories, iv) compare z*(t), the cumulative 
loss of the original trajectory, with the average of Zi(t) over the sample of 
trajectories. Since from (128]) there may be up to £*• different estimates of 
Jij one may use the mean of the estimated Jy or sample from them. There 
are two reasons to perform the comparison basing on the cumulative losses 
Zi(t) rather on k(t): first, as already pointed out in Section [21 Zi(t) is the 
quantity of interest in the context of operational risk; second, at least in the 
case in which J has no causal loops, Z{(t) has the peculiar property to be 
self-averaging in time, i.e. Zi(t) — > (zi(t)) (see Section [4.3p . being perfectly 
suitable to be compared with its average. 

A slightly modified version of the previous strategy allows to test for 
the forecasting capability of the model as well: it is sufficient to estimate the 
parameters using only the first f'T (with < / < 1) time steps in the original 
trajectory, but still sampling trajectories lasting T time steps; in this way we 
try to reproduce the behavior of Zi(t) in the last (1 — f)T time steps ignoring 
the information contained in the same time steps of the original trajectory. 
For / = 1 the test on the forecasting capabilities reduces to the consistency 
check. In the case in which the matrix J is known to have no causal loops it 
is not necessary to simulate the trajectories using (JTJ), but all the quantities 
of interest such as (zi(t)) or var Zi(t) may be rather directly calculated by 
means of the exact solutions. 

Let us briefly comment on the parameters chosen to generate the original 
trajectory. From (Tj[|) we see that 9{ may be chosen to be the unit of measure- 
ment of U by properly rescaling 9i, Jij and the noise, so that one can take 
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Figure 1: Graph associated with the matrix J. The nodes labeled 1 and 2 correspond to 
free processes; the process 3 is influenced only by a free process (node 1), while the process 
5 is influenced by two free processes (nodes 1 and 2); the process 4 is influenced only by 
a process (node 3) which is influenced only be a free process (node 1). 

9i = ±1, the sign being the same of 9i before the rescaling; we are forced to 
choose 9i = —1, Vi because (123]) does not allow the estimation of positive #,. 
We stress that the number of processes (N = 5 in this case) does not 
play any significant role; actually the only relevant element is the complexity 
of the subgraphs contained into the graph associated with the matrix J: for 
this reason we start from the simplest subgraph (a free process) and move 
on considering progressively more complex ones. In fact, the structure of 
the matrix J is chosen to encompass all the cases explicitly treated in the 
Appendix: free process (i = 1,2), process influenced only by a free process 
(i — 3), process influenced only by a process which is influenced only by a 
free process (i = 4) and process influenced by two free processes (i = 5). 
The graph representing the influences among the processes is shown in Fig. 
[1] since it has no loops it is possible to estimate also A. In order to satisfy 
the constraint imposed by (128]) we choose: 
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(31) 



and t*j = 5, for i and j such that Jij ^ 0. The values Aj are chosen basing 

on the following argument; the more events suitable for the estimation of 9 
and J are found, the more the estimated values will be reliable; the events 
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— * 

suitable for the estimation of 9 (see ( )2Up ) are more likely to be found in a 
database with a low density of losses, however, if this density becomes too 
low, there will be no events left to perform the estimation of J (see (125]) ). 
We find that a reliable estimation of 9 and J is obtained using: 

A = (2, 3, 5, 5, 5) (32) 

and T = 2 ■ 10 5 . The initial condition used is: li(t) — 0, for i — 1, . . . , 5, 
corresponding to a state in which all processes do not generate losses and 
thus can be considered perfectly functional. 

For / = 1 the parameters are estimated with the following relative errors: 

59- (0.0033, 0.0029, 0.0390, 0.0074, 0.0343) 

SJ 31 ~ 0.0959 5J 43 ^ 0.1313 

5J 51 ~ 0.0377 5J 52 ~ 0.1466 
5\ ~ (0.0030, 0.0032, 0.0407, 0.0022, 0.0337) , 

while for / = 0.75: 

59- (0.0044, 0.0032, 0.0468, 0.0094, 0.0369) 
5J 31 ~ 0.0659 5J i3 ~ 0.0009 
5J 51 ~ 0.0566 5Jr o2 ~ 0.1520 

5A ~ (0.0033, 0.0052, 0.0445, 0.0012, 0.0332) . 

In Fig. [2] we compare z*(t), the cumulative loss of the original trajectory 
(solid line) with (zi(t)), the average over the noise of Zi(t) obtained estimat- 
ing the parameters from the original trajectory and calculated with ([7]), f[T7|) . 
(TA~6]) and f[Q|) . for / = 1 (dashed line) and / = 0.75 (dashed-dott ed line); 
the semi-transparent regions span one standard deviation a Zi (t) = ^/var Zi (t) 
around (zi(t)) and have been calculated by means of ()9]), (1T8|) and the ana- 
logues of (1A.6J) and (1A.9|) for the variance. Since both the process % = 1 and 



the process i = 2 are free and their results are qualitatively identical, we 
only show those relative to the process i — 1; only the last 10 4 time steps are 
shown for the sake of readability. The fact that z*(t) is reproduced for all 
the processes with an error which is far less than one standard deviation for 
/ = 1 proves the consistency of the estimation of the parameters proposed 
in Section HI the same result for / = 0.75 shows that the model exhibits the 
capability to forecast the cumulative losses in the last quarter of the original 
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Figure 2: z*(t), the cumulative loss of the original trajectory (solid line) and (zi (£)), 
the average of Zi(t) over the noise obtained estimating the parameters from the original 
trajectory, for / = 1 (dashed line) and / = 0.75 (dashed-dotted line); the limits of the dark 
(light) semi-transparent region are (zi(t)) ±cr Zi (t) for / = 1 (/ = 0.75); the darkest region 
is the overlap between the regions relative to / = 1 and / = 0.75. For all the processes 
z*(t) is reproduced with an uncertainty which is far less than cr Zi (t) and the error regions 
overlap almost completely. 



trajectory. Moreover, the error regions relative to / = 1 and / = 0.75 over- 
lap almost completely for all the processes: this means that all the relevant 
information about the parameters of the model is contained in the fraction 
of the database used for the estimation and that the information contained 
in the remaining part is redundant. 

In Fig. [3] we show z%(T) (dashed-dotted line) and the Gaussian distribu- 
tion of Zi(T) obtained estimating the parameters from the original trajectory, 
for / = 1 (solid dark line) and / = 0.75 (solid light line). Fig. [3] refers to 
the process i = 4 since its associated subgraph is the more complex; the 
results obtained for the other processes are completely analogous. We notice 
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za{T) 

Figure 3: z|(T), the cumulative loss of the original trajectory at the final time step 
(dashed-dotted line) and the Gaussian distribution of Z4 (T) obtained estimating the pa- 
rameters from the original trajectory, for / = 1 (solid dark line) and / = 0.75 (solid 
light line) . The two distributions overlap almost completely and their peaks correspond to 
z\(T). The relative error of the VaRs over the time horizon T and with level of confidence 
99.865 for / = 1 (dashed dark line) and / = 0.75 (dashed light line) is ~ 10~ 3 . 



that the two distributions overlap almost completely and that their peaks 
correspond to z\{T). 

The VaR over the time horizon T and with level of confidence 99.865 
can be easily calculated for a Gaussian distribution, being equal to (zi(t)) + 
3 cr 2 -(t); in Fig. [3] the VaRs of the process 4 for / = 1 (dashed dark line) and 
/ = 0.75 (dashed light line) are shown to be almost identical: their relative 
error is < 10~ 3 . In Tab. [2] the VaRs are reported for / = 1 and / = 0.75, 
together with their relative error (TVaR which is ~ 10~ 3 for all the processes. 

As pointed out in Section [3], if the variance of the noise is not finite, the 

distribution of Zi(t) for large t is not Gaussian; using the generalized limit 

theorem [26), it is possible to show that the distribution of Zi(t) must be 

positively skewed and heavy-tailed, in agreement with some empirical results 



27) . However, the analysis of the model with infinite variance of the noise 



poses both conceptual and computational problems: since the variance of 
li(t) diverges as well, another reliable measure of the width of the distribu- 
tion must be found to quantify the goodness of the model predictions; the 
relationship between the parameters of the distribution of li(t) and the pa- 
rameters of the cumulative loss distribution Zi(t) is entirely different from 
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Table 2: VaRs over the time horizon T and with level of confidence 99.865 for the process 
i calculated from the cumulative losses z% (T) obtained estimating the parameters from the 
original trajectory, for / = 1 and / = 0.75; <5VaR, the relative error between VaR-' - and 
VaR/ =0 " 75 is ~ 10~ 3 for all the processes. 

i VaR /=1 VaR /=a75 <5VaR 



1 


13 906.79 


13 886.83 


1.43 • 10~ 3 


2 


3 337.36 


3 360.42 


6.88- 10~ 3 


3 


430.68 


433.29 


6.05- 10~ 3 


4 


299.79 


301.61 


6.03- 10~ 3 


5 


524.13 


520.70 


6.56- 10~ 3 



([TO]) and must be found; the rate of convergence to the non-Gaussian limit 
distributions is much slower and even a purely numerical analysis is much 
more computationally expensive. 

6. Comparison with an alternative model 

Let us now consider an alternative model with a slightly different dynam- 
ics: 

k{t) = Si (t) ■ m{t) , (33) 

where Si(t) is drawn from some distribution, which we are leaving unspecified 
for the moment, and independently from the noise, while 

fk(t) = & [ib J v C v(t) +&i + Zi(t) ) • (34) 

This dynamics involves only the variables rii(t), while the time dependence 
in Si(t) should be intended has a mere label to distinguish different values 
drawn from the same distribution. li(t) has still the meaning of the loss of the 
process % at the time t and, according to (133]) . is nonzero if rii(t) is equal to 
one, i.e. if the argument of the Heaviside function in (1341) is larger than zero, 
which is the same condition under which Z,(£) in (CQ) is nonzero. The dynamics 
of rii(t) controls the number of time steps in which the losses occur, while the 
amount of the losses depends on the distribution of Si(t). From this point 
of view, such a dynamics model can be considered a generalization of the 
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LDA, and it makes sense to call frequency the stochastic process associated 
to rii(t) and severity the random variable associated to Si(t). Models based on 
a dynamics similar to the one defined by fl33|) and (1M|) have been introduced 



in |2ll . |22| . Since frequency and severity are independent, one can imagine 
to fix the severity distribution so that the dynamics of (1331) is equivalent to 
the one of ([I]) on average, in the sense that /.;(£) and li(t) have the same 
distribution; it can be done by choosing the severity so that: 

Pr\s (t) > x] - FrUt) > X] - Fl[k{t) > X] (35) 

where the r.h.s. can be calculated once that all the parameters (6, J, and A) 
are known. As a consequence, the dynamics defined by (1331) and (1M|) appears 
more general than the one defined by ([T|), since, at least in principle, it per- 
mits to use arbitrary severity distributions, even the ones not satisfying (135|) . 
However, using an arbitrary severity distribution in general does not allow 
to employ the procedure described in Section fl~3l to estimate the parameters 
of the noise. 

Actually it is possible to show that fixing the mean of the severity so that: 

<«<*» = S§- (36) 

is sufficient to estimate A». For a free process i and for 8i < one has that: 



(*(«)> 



( ni (t))t tfMt ,_ 

z_M i 1 { } 

t ' (1 - Pr [k = | C i3 = 0, Vj]) A, ' 



where (TTU|) has been used, and the last equalities derive respectively from the 
self- averaging property of the cumulative loss and from (l29]) rl With a little 
thought this result can be easily extended to the more general case in which 
the matrix J has no causal loopso Nevertheless, there is no solid argument 



In the case of a noise distribution different from the one in ([3]) one has to verify that, 
like in (|37j) . {si(t)) depends only on the parameters of the noise. 

2 The calculation of (rii(t)) is similar to the one of (h(t)) that has been carried out in 
Section [3] and in the Appendix. 
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that justifies to impose the constraint (1361) on the severity. In particular, it is 
not clear why the mean should be the only moment of the severity depending 
on the parameters of the model (more specifically of the noise). Therefore, it 
would be desirable to have a coherent model that allows to derive (|36|) . and 
thus to estimate the parameters of the noise. The crucial observation is that 
the constraint (j3"5|) . and consequently the model whose dynamics is defined 
by ([I]), implies the constraint (l3"oT) . Hence, the model whose dynamics is 
defined by ([1]) has the virtue that it allows to estimate the parameters of the 
noise in a coherent way, both in the sense that the constraint (136]) has not 
to be imposed "by hand" and in the sense that all the terms appearing in 
the equation of motion have a clear interpretation (explicitly corresponding 
to a mechanism for producing or avoiding operational losses, as explained in 
Section [2]). Moreover, it is worth to point out that, in the case in which (135|) 
holds, the severity has a much clearer relationship with the parameters of the 
model; it is straightforward to showa that for a free process i and for Q, L < 0: 



Fs ^ {x) = l-K^-BA 



1 ~ F Ut)(~ 6 i 



(38) 



i.e. that the severity has the excess distribution of the noise over the threshold 
6i (with F y (x) = Pr[y < x] we denote the distribution of the random variable 
y, evaluated in x). As regards the more general case in which the matrix J 
has no causal loops, it is still possible to show that a generalized version of 
(|38|) holds, where both the numerator and the denominator are replaced by 
linear combinations whose coefficients depends on the topology of the graph 
associated to the matrix J, similarly to the cases treated in the Appendix. 

7. Conclusions 

In this paper we proposed a dynamical model to forecast operational 
losses in banks. The equation of motion provides two different mechanisms 
for the generation of losses in a process: the interaction with other processes 
and the spontaneous generation due to a random noise; since the different- 
time correlations play a crucial role in this context, the interactions are non- 
local in time; the effort made by the bank to avoid the occurrence of losses 



3 Using (l35|) and noting that the distribution of li(t) can be calculated observing that 
Pr [Ji(i) >x} = J °° G[k(t) - x]d£i(t). 
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is also taken into account by means of an inhomogeneous external field. We 
have shown that, if the coupling matrix J is known to have no causal loops, 
all the parameters of the model except the maximum times of correlations 
t% can be estimated from real data, so that the model can be tailored on the 
internal organizational structure of a specific bank; in the most general case 
also the parameters of the noise must be known a priori. Focusing on the case 
in which the coupling matrix J is known to have no causal loops, we exactly 
solve the model and find the asymptotic behaviour of the cumulative loss, 
showing that the non-locality of the equation of motion is not sufficient alone 
to modify the shape of the cumulative loss distribution. We specialize the 
procedure for estimating 6 and J suggested in |22| to the considered model, 
propose a procedure to estimate the parameters of the noise, and validate it. 
Many statistical approaches, like the static LDA, are founded on the 
implicit hypothesis that the basic statistical properties of the distributions 
of operational losses do not change in time; basing on this assumption the 
capital charge that the bank has to put aside to face operational risk the 
next year is calculated from the loss distribution built from historical data. 



The assumption made by the approach proposed here and in 21] is definitely 
weaker and consists in assuming that the basic mechanisms underlying the 
generation of operational losses do not change in time. The crucial advantage 
of such an approach is that it allows to make forecasts about future losses. 
The forecasting power of the model has been investigated estimating the 
parameters of the model only from a fraction / of a simulated database of 
operational losses and comparing the cumulative losses of the remaining part 
with those forecast by the model. We have shown that the model exhibits 
surprisingly good capabilities in forecasting the future losses even for / = 
0.75: in particular the relative error between the actual VaR (/ = 1) and the 
forecast VaR (/ = 0.75) is ~ 10~ 3 for all the processes. In order to check the 
performances of the proposed model, both the validation of the parameters 
estimation and the test of the forecasting power has been carried out using 
simulated data. 

We think that the general framework of dynamical models for operational 
risk deserves further investigation in several directions; let us just cite few 
examples: the case in which the coupling matrix has causal loops could be 
explored, more complex terms of interaction in the equation of motion could 
be considered or different mechanisms for the generation of losses included; 
as explained in Section [51 the study of the case in which the variance of 
distribution of the noise is not finite looks particularly promising as it may 
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lead to the emergence of heavy-tailed cumulative loss distributions. 

Appendix A. 

The results ((7j), (jSJ), (TTTj) and (Tl8|) will be extended in two particular cases. 
In the first case the process % is influenced only by the process j, which in 
turn is influenced only by the process k which is free (i •<— j -<— k). In this 
case the average over the noise is: 

h{t) J] dl 3 (t-s)dUt) 11 ^(t-r); (A.l) 



Ks<i* 2<r<t*+t*, 



the events Cy(t) = 0, . . . , Cij(t) = £*,• still cover the entire domain of i 



in- 



tegration, but are not mutually exclusive: in fact Cij(t) depends through 
l J (t-l),l j (t-2) 1 ...,l j (t-t* ij ) on C jk (t-l),C jk (t-2),...,C jk (t-t* ij ) which 
in turn have crossed dependences from l k (t — 2), l k {t — 3), . . . , l k {t — t*j — t* k ) 
so that, for example, both Cj k (t — 1) and Cj k (t — 2) depend on l k (t — 3). 
However, it is still possible to rewrite (1A.1|) in the following way: 



Ramp J tj V c s , +9i + £i(t) d£i{t) 
ici \ S '=i J 

■ I n <&(* - s ) n «*(* 

J{e[«j(t-s")]=c s //} s // 1<s<t *. 2<r<t*.+t* 

— — z j — — i] ' ]k 



— r) 



(A.2) 



j2 m (( J vJ2 Cs ' +e i) 

{c} s'=l 

■ I n «fe(* - s ) n «*(* 



{©Pi(*-s")]=c s "} s » 1< S <(«. 2<r<t*.+t* 

— — ij — — i] ]k 



where the sum over {c} is over all the possible configurations C\ G {0, 1}, ... , 
Q* G {0, 1}. Once a particular configuration {c} has been assigned, the inte- 
gral on the right hand side of ( 1A.2J) is simply the probability that G [lj(t — s')] 
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c s «, for s" = 1, . . . , £*j and equals to: 



^■fe+E/,: i e[/ fc (i- S "-r')]+0 J +? J (t-s'') 



n ^ 



-s 



-c.il ( l< s <t*. 
.11 v 



][ «*(< 



E 

(4 



{° 



2<r<t*+i* 
— — v ' jfc 



^■fc+E r // = j/'rfr"+ e j+5j(*-*")|=c s » f ^ i<«<t; 



|ep»(t-r')]=d P /} ^ 



e\h(t-r')]=d r , , 2 <r<t*.+t* 



n ^(*- s ) 

J] d&(*-r), (A.3) 



where again the sum over {d} is analogous to the sum over {c} and r' = 
2, . . . , £*• + t* k . We notice that integrals on the right hand side of (1A.3J) are 
decoupled and can be respectively rewritten as: 



n 



s + t* 
■> 3 
■i r l =a 



i< s <t*: •'©I Jj*+E r # = i* drz+flj+feC*-*) 



<&•(*- 5) 



n 



!<-<$ 



«+^* 



s+t 



Pf (^i* 5Z dr ' + ^'J'W + X ~Pj[ J Jk ^2 dr ' + d i) Sc °>° 



jk 



2<r<t*.+t*. 



(A.4) 



pf(^)4„,i+ l-pf(0fc) *c.,0 



(A.5) 



II / dUt~r)= 11 

2<r<t* J+ t* fc ■/ep fc (t-r)]=d r 2<r<i£. 

Using flA~2l) . (IA~3l . ([Oil and (JX31) one finally obtains: 

s'=l 

s+t *jk 

r'=s r'=s 

II tf(^)^i+[l-pf(^)l^,o , (A.6) 



M 



e n 

{4 !<«<*& 



s+t 



J c s ,0 



2<r<t^+t* fc 
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while the variance is easily obtained from (1A.6|) by replacing mf with ml 
and subtracting (Zj(t)) 2 . The value of Aj can again be estimated from (!T5|) . 
(IA.6|) , (I2"5j) and f[2"5j) , analogously to (150]) . This case can be trivially extended 



to all the graphs which are simple paths and contain m nodes, i.e. to all the 
graphs of the type i\ •<— ii •<—... -f- z m _i «— i m . We point out that in (1A.6I) 
one has to sum 2 t v + i>° terms; in the case in which the simplest loop were 
present in the graph associated with the matrix J, i.e. a loop of the process 
i with itself (Ja 7^ 0), it is easy to argue that the number of terms to sum 
in order to calculate (h(t)) would be equal to 2**«; in fact, such a topology 
is equivalent to a simple path containing t copies of the process i. 

In the second case that we will consider the process i is influenced only 
by two processes j% and j'2 that are both free. In this case U(t) depends only 
on l h {t - 1), . . . , l h {t - t* h ) through C ih {t) and on l J2 (t - 1), . . . , l J2 (t - t* h ) 
through Cij 2 (t), so that the average over the noise equals to the average over 
the random variables &(£), ^(t-1), . . . , ^ (£-%), £ h (t-l), . . . , £j 2 (t-t* j2 ) 
and (HTJ and (JI2J read: 

poo 

(k(t)) = / Ramp [J ih C in it) + J iJ2 C iJ2 (t) + 9i + &(*)] 
Jo 

■ 11 di^t-Si) J] dl J2 {t-s 2 )dUt), (A.7) 



dinit-s) J] / dl 32 {t-s) = 

1 s- 1 Ks<t*. ^° 

— — VI — — iJ2 

t*. \ / t*. 

VI f \ I V2 p 

e / n 4fa(* - a i) • e / n ^ - *o 

oi=0*' C '«i( t ) =cl K«i<t*. / \cfO C '»W" C2 l<S2<t'. 

— — VI / \ — — *J2 

(A.* 

where the domain of integration of the variables ^ (£— 1) , . . . , ^ (t—t*, ) , £^ a (£— 
1), . . . , £,j 2 (t — t*j 2 ) has been divided in subsets with fixed values of Cij x (t) and 
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Cij 2 (t). Inserting (1A.8I) and (TTB|) into (jA.7[) one obtains: 






(^)) = E( t f)bf 1 (^)] cl [i-pf 1 (^ 



Cl =0 v Cl 
t*. 
" 2 '*?,, 



4*. — ci 
Vl 



E 



c 2 r-i F//1 M*i,„-C2 



(A.9) 



Kfe)] 2 [l-Pf 2 (^)] 



C2=0 x 

■ mf{c x J ih + c 2 J ii2 + 4 ) . 

As in the aforementioned case, the variance is obtained from (1A.9I) by re- 
placing mf with ml and subtracting (Zj(£)) 2 . Analogously to f[3"0"j) . the 
value of Aj can be estimated from (Tl5|) . ( 1A.9J) . ([23]) and (128|) and the con- 
straint t*,Jij l + t* - 2 Jij 2 < \9i\ emerges. This case can be also trivially ex- 
tended to all the graphs in which the process i is influenced by an arbi- 
trary number of (say m) free processes, leading to the general constraint 

hji Jiji + • • • + Hj m Jijm < I "i I ■ 

In the more general case in which the graph representing the interactions 

has no loops both (h(t)) and var/j(t) are sums over all the simple paths 

starting from a leaf node and ending to the node i which can be calculated 

combining the extensions to the first and second case treated in the Appendix. 

Also in this general case both (h(t)) and varZj(t) do not depend on time and 

are finite, allowing to extend the results (1291) and (l30l) of Section 14.31 
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